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INVENTOR: Sanjay Khanna 

Technique for Fast and Efficient 
Internet Protocol (IP) Address Lookup 

BACKGROUND OF THE INVENTION 

* 

Field of the Invention 

The present invention relates to distributed computing, and deals more particularly with a 
method, system, and computer program product for quickly and efficiently looking up IP (Internet 
Protocol) addresses, for example in routing tables used by network routers. 

Description of the Related Art 

Business and consumer use of distributed computing, also commonly referred to as 



RSW9-2000-0050-US1 



-1- 



1 / 

1 * 

network computing, has gained tremendous popularity in recent years. In this computing model, 
the data and/or programs to be used to perform a particular computing task typically reside on 
(i.e. are "distributed" among) more than one computer, where these multiple computers are 
connected by a network of some type. The Internet, and the part of the Internet known as the 
5 World Wide Web (hereinafter, "Web"), are well-known examples of this type of environment 
wherein the multiple computers are connected using a public network. Other types of network 
environments in which distributed computing may be used include intranets, which are typically 
private networks accessible to a restricted set of users (such as employees of a corporation), and 
extranets (e.g., a corporate network which is accessible to other users than just the employees of 
10 || the company which owns and/or manages the network, such as the company's business partners). 

H The Internet Protocol (IP) is used to enable the interconnected networks of the Internet to 

communicate with each other. In the most commonly used version of IP, which is known as 
□ "IPv4", the network address of each sender and receiver of information is specified as a 4-byte 
(32-bit) number. The leftmost bits of an IP address uniquely identify the network in which a 
1 5 * 1 particular device is located, and the rightmost bits uniquely identify the device within that 

network, The bits comprising the network identification are commonly referred to as the network 
number or network prefix of an IP address, while the rightmost are commonly referred to as the 
local address or host address. 
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Originally, IP addresses were divided into 4 classes, referred to as Class A, Class B, Class 
C, and Class D. In this addressing scheme, the bit settings in the leftmost 4 bits of the 32-bit 
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address identify the class for a particular IP address. Each different class uses a different 
boundary for distinguishing which part of a 32-bit address is considered to be the network prefix, 
and which part is considered to be the local address. A network address from a Class A network 
uses 7 bits for the network prefix and 24 bits for the local address. A Class B network uses 14 

5 bits for the network prefix and 16 bits for the local address. A Class C network uses 21 bits for 
the network prefix and 8 bits for the local address, and a Class D network uses 28 bits as a 
multicast address. The 32-bit addresses are commonly expressed using what is known as "dotted 
quad" notation, where each 8-bit byte of an address is converted to a decimal representation and 
the 4 decimal numbers are then written as a string separated by periods. Thus, the dotted quad 

10 j\ representation of a Class A network address specifies the network number as the first decimal 
gi number, followed by 3 decimal numbers identifying the local address; the dotted quad 
^ representation of a Class B network address, on the other hand, can be interpreted as 
"network.network.local.locar. 

'•£3 

ol A 3~level addressing scheme may alternatively be represented within the 32 bits, where the 

15 CI additional level is used to group local addresses into subnetwork, or subnet, addresses. The 

combination of the network prefix with the subnet address is referred to as an "extended network 
prefix". A subnet mask, which is a 32-bit number specified as a series of contiguous 1-bits on the 
left and 0-bits on the right, is used to indicate where the extended network prefix ends and the 
local address begins. 
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The gateways and routers which are responsible for routing data packets through a 
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distributed network must store information about the path to use in order to route the packets 
destined for a particular address. This information is stored in routing tables (sometimes referred 
to as a "routing cache"). Use of subnet addressing enables reducing the number of entries in a 
routing table, because routers and gateways which are external to an organization's private 
5 network need only contain a routing table entry for the organization's network address: the 
organization's internal routers then handle the routing among subnetworks using the subnet 
address specified in a data packet. 

Several problems were encountered with the 4-class addressing scheme as distributed 
computing gamed in popularity. It appeared as though the range of available address numbers 
10 01 would soon be exhausted, and many of the addresses which had already been assigned were not 
^ being used, due to the inflexible boundaries dictated by this addressing scheme. 

'iS r.' 

r I A technique known as Classless Inter-Domain Routing (CEDR) was developed to address 

Q 1 these problems. In CIDR, a network mask value is used to determine the boundary between 
P network addresses (including the subnet address) and the local address, without regard to any 
15 notion of class structures. CBDR requires that routers and gateways use a consistent, longest- 
match algorithm for forwarding data packets. In this algorithm, that part of the destination 
address in a data packet which is identified as being the network address (using the network 
mask) is compared to entries in a routing table (where the table entries are generally also specified 
in terms of a network mask) to locate the path to be used for forwarding the packet. If more than 
20 one routing table entry matches the destination address, then the more specific entry (which is the 
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entry having the longest network prefix) must be used. 

Even though CIDR provides interim relief for address assignment, 32 bits is deemed to be 
insufficient for supporting the future growth of distributed computing. Thus, a newer addressing 
scheme known as "IPv6" has been defined which uses 128-bit addresses. This addressing scheme 
has not yet been widely implemented. 

Routers and gateways (referred to hereinafter as routers, for ease of reference) must be 
able to quickly evaluate the IP address in a data packet in order to determine how to route the 
packet while providing an acceptable level of performance and throughput. As link speeds are 
increasing, the number of IP packets which a router is required to process per second is becoming 
very high. One critical factor in the router's performance and throughput is the route lookup 
technique used with the routing tables. 

Most implementations of routing tables today use radix trees. Radix trees require a 
significant amount of programming logic, and expenditure of a significant amount of computing 
time in traversing the trees to find a particular route. Furthermore, radix trees cannot exploit a 
multi-processor (MP) approach wherein the computing task is shared among processors. Another 
existing technique is use of sorted linked lists. Linked lists have well-known performance 
problems, and are also not MP-exploitable. Some existing implementations use hash tables along 
with sorted linked lists. This approach provides performance which is significantly better than 
linked lists alone, but still does not provide an optimal (nor an MP-exploitable) solution. A 
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technique designated "DIR-24-8-BASIC" was proposed by Pankaj Gupta et al. at Infocomm 
1998, where two separate routing tables are used: one table for routes which are less than 25 bits 
long, and a different table for routes which are 25 bits or longer. This technique, however, 
assumes that most routes have prefixes of 24 bits or less and is therefore thought by the inventor 
5 of the present invention to have a rather narrow focus, (A copy of this conference paper may be 
found on the Internet at address http://www-cs-students, stanford.edu/~pankaj/paps/ 
Infocom98.pdf.) A technique known as "Multi-Protocol Label Switching" (MPLS) has also been 
proposed, where this technique would replace the longest-prefix match approach with a simple 
direct lookup. This approach, however, would require adoption of new protocol standards, and 
10 sj i thus is not easily nor quickly adaptable into the established distributing computing infrastructure. 

,11 In addition to the need to inspect IP addresses for routing purposes, there are a number of 

'is; j? 

new servers or gateways which provide a specific service based upon the IP address of the client. 
□ Examples include firewalls, IPSec (IP Security) gateways, and Dynamic Host Configuration 
0; Protocol (DHCP) servers. (For example, DHCP servers, which manage the assignment of IP 
1 5 addresses to hosts, generally function differently depending on whether the host is a client or a 

server.) The performance of these servers and gateways will be constrained by the time it takes to 

inspect the IP address and determine which service to provide. 

Accordingly, what is needed is an improved technique for evaluating or interpreting IP 
address values. 

20 
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SUMMARY OF THE INVENTION 

An object of the present invention is to provide an improved technique for evaluating or 
interpreting IP address values, 

Another object of the present invention is to provide an IP address lookup technique 
5 which is MP exploitable. 

It is another object of the present invention to provide a technique which enables IP 
address lookup time to be nearly constant, regardless of the particular address value. 

0| Yet another object of the present invention to provide a technique which enables IP 

":! 

10 address lookup to occur with minimal locking requirements. 

r j A further object of the present invention to provide a technique for IP address lookup 

ill which can be used with CIDR. 

Still another object of the present invention to provide an IP address lookup technique 
15 which easily extends to use with IPv6 addresses. 

Other objects and advantages of the present invention will be set forth in part in the 
description and in the drawings which follow and, in part, will be obvious from the description or 
may be learned by practice of the invention. 
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To achieve the foregoing objects, and in accordance with the purpose of the invention as 
broadly described herein, the present invention provides a computer program product, a system, 
and a method for fast and efficient address lookup for an address comprised of a plurality of 
address components and wherein each address component is deemed to be more significant than 
5 its next-sequential neighboring address component. In a preferred embodiment this technique 
comprises: creating a plurality of arrays comprising an array for each of the address components, 
wherein each array comprises a plurality of entries which are indexed using values of the address 
component for which the array was created; and retrieving the stored information associated with 
a selected address value from the plurality of arrays. Creating the plurality of arrays preferably 
10 *1| further comprises: obtaining a particular address value to be represented in the plurality of arrays; 
□I obtaining a bit mask associated with the particular address value; indexing into a highest-order 
' % l one of the arrays using a most-significant component of the particular address value as an index 
element; setting a flag associated with the index element to on if the bit mask indicates that the 
r | next-sequential neighboring address component is considered significant, and setting the flag to 
15 fl ! off otherwise; and repeating the indexing and setting while the bit mask indicates that the next- 
□ sequential neighboring address component is considered significant, and (1) storing information 
associated with the particular address value in a storage or memory location and (2) setting a 
pointer field associated with the index element to point to the storage or memory location, 
otherwise. Retrieving the stored information preferably further comprises: obtaining the selected 
20 address value; obtaining a selected bit mask associated with the selected address value; indexing 
into the highest-order one of the arrays using the most-significant component of the selected 
address value as the index element; and determining that no result is available if the index element 
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has no stored information, and continuing otherwise. The continuing further comprises: checking 
the flag associated with the index element; and returning the stored information from the storage 
or memory location pointed to by the pointer field when the flag is set off or repeating the 
indexing and determining when the flag is set on. 

The repeating preferably further comprises setting a use count associated with the storage 
or memory location to a number which represents a count of the array entries which point to this 
storage or memory location when the next-sequential neighboring address component is not 
considered significant. 

The stored information in the memory or storage location may comprise an associated bit 
mask, and the retrieving may further comprises resolving a collision. Resolving the collision 
further comprises: comparing the selected address value to each bit mask associated with the 
stored information from multiple storage or memory locations, yielding a plurality of bit mask 
results; and selecting a collision result using that one of the bit mask results which both (1) 
matches the selected address value according to the selected bit mask and (2) has the longest 
associated bit mask. 

The address may be an Internet Protocol (IP) address. In this case, it may be an IP 
version 4 address, and thus there are 4 components in each IP version 4 address and 4 arrays in 
the plurality. Or, the IP address may be an IP version 6 address, and thus there are 16 address 
components in each IP version 6 address and 16 arrays in the plurality. 
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The present invention will now be described with reference to the following drawings, in 
which like reference numbers denote the same element throughout. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram of a networked computing environment in which the present 
5 invention may be practiced; 

Figure 2 provides an overview of the technique used by the preferred embodiment of the 
present invention when storing and subsequently interpreting 32-bit IP addresses; 

K Figure 3 illustrates an example of adding routes to routing tables, according to the 

I i preferred embodiment of the present invention; and 

10 O Figures 4A - 4C illustrate examples of looking up routes in the routing tables created by 

-55! '■, 

* I the example of Figure 3, according to the preferred embodiment of the present invention, 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

Fig. 1 illustrates a data processing network 40 in which the present invention may be 
practiced. The data processing network 40 may include a plurality of individual networks, such as 
1 5 wireless network 42 and network 44, each of which may include a plurality of individual 

workstations 10. Additionally, as those skilled in the art will appreciate, one or more LANs may 
be included (not shown), where a LAN may comprise a plurality of intelligent workstations 
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coupled to a host processor. 

Still referring to Fig, 1, the networks 42 and 44 may also include mainframe computers or 
servers, such as a gateway computer 46 or application server 47 (which may access a data 
repository 48). A gateway computer 46 serves as a point of entry into each network 44. The 
5 gateway 46 may be preferably coupled to another network 42 by means of a communications link 
50a. The gateway 46 may also be directly coupled to one or more workstations 10 using a 
communications link 50b, 50c. The gateway computer 46 may be implemented utilizing an 
Enterprise Systems Architecture/370 available from the International Business Machines 
i|I Corporation (IBM), an Enterprise Systems Architecture/390 computer, etc. Depending on the 
10 [ M application, a midrange computer, such as an Application System/400 (also known as an AS/400) 
] : may be employed. ("Enterprise Systems Architecture/370" is a trademark of IBM; "Enterprise 
, Systems Architecture/390", "Application System/400", and "AS/400" are registered trademarks 
£ I of IBM.) These are merely representative types of computers with which the present invention 
y * may be used. One or more routers (not shown) are typically used to route traffic among 
15 ^ - networks. 

The gateway computer 46 may also be coupled 49 to a storage device (such as data 
repository 48). Further, the gateway 46 may be directly or indirectly coupled to one or more 
workstations 10, and servers such as gateway 46 and application server 47 may be coupled to 
other servers such as server 43. 
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Those skilled in the art will appreciate that the gateway computer 46 may be located a 
great geographic distance from the network 42, and similarly, the workstations 10 may be located 
a substantial distance from the networks 42 and 44. For example, the network 42 may be located 
in California, while the gateway 46 may be located in Texas, and one or more of the workstations 
5 10 may be located in New York. The workstations 10 may connect to the wireless network 42 
using a networking protocol such as the Transmission Control Protocol/Internet Protocol 
("TCP/IP") over a number of alternative connection media, such as cellular phone, radio 
frequency networks, satellite networks, etc. The wireless network 42 preferably connects to the 
gateway 46 using a network connection 50a such as TCP or UDP (User Datagram Protocol) over 
10 1 IP, X.25, Frame Relay, ISDN (Integrated Services Digital Network), PSTN (Public Switched 
W Telephone Network), etc. The workstations 10 may alternatively connect directly to the gateway 
"* I 46 using dial connections 50b or 50c. Further, the wireless network 42 and network 44 may 

connect to one or more other networks (not shown), in an analogous manner to that depicted in 
O Fig. 1. 

1 5 * ; In the preferred embodiment, the present invention is implemented in computer software, 

although a solution implemented partly or entirely in hardware is also within the scope of the 
present invention. When implemented in software, the programming code which embodies the 
present invention is typically accessed by a router (or, equivalently, by a server or gateway such as 
server 43, gateway 46, and/or server 47) from long-term storage media of some type, such as a 

20 CD-ROM drive or hard drive. The software programming code may be embodied on any of a 
variety of known media for use with a data processing system, such as a diskette, hard drive, or 
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CD-ROM. The code may be distributed on such media, or may be distributed from the memory 
or storage of one computer system over a network of some type to other computer systems for 
use by such other systems. Alternatively, the programming code may be embodied in the memory 
of the appropriate device. The techniques and methods for embodying software programming 
code in memory, on physical media, and/or distributing software code via networks are well 
known and will not be further discussed herein. 

The present invention provides a flexible, efficient, and fast technique for looking up IP 
addresses. This lookup function will be discussed herein in terms of looking up IP addresses in 
routing tables, although this is for purposes of illustration and not of limitation. Alternatively, IP 
addresses may be used for looking up other types of information, such as information which 
determines a particular service to be performed based upon the value of a particular address (as in 
the DHCP example previously described). Also, it should be noted that this technique is 
adaptable to addresses other than IP addresses, provided those other addresses have similar 
characteristics (such as use of multiple address components, etc.). 

In the preferred embodiment, a separate array is used for each 8-bit component of an IP 
address. When used with IPv4 addresses, 4 such arrays are needed; an implementation used with 
IPv6 requires 16 arrays. Preferably, each array has 2 s = 256 distinct entries, such that each 8-bit 
address component is used to index to one of these entries. The preferred embodiment will be 
described using the dotted quad notation, which in an IPv4 example may be depicted in the form 
"a.b.ad" The preferred embodiment also assumes that IP addresses have an associated mask, as 
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previously discussed. The least specific mask value is 0.0.0.0, and the most specific mask value is 
255.255.255,255. 

As shown in Fig. 2, 4 separate arrays 200, 220, 240, 260 are used in the preferred 
embodiment when interpreting 32-bit IP addresses. For a particular address a.b.c.d, the first 8-bit 
5 component ("a 55 in this example) is used to index into the first array 200. The remaining 

components are used to index into the other arrays 220, 240, 260, as shown in Fig. 2. When 
performing an example lookup function "F' on an address a.b.c.d, the illustration in Fig. 2 shows 
information 270 (such as a control block for a route "RTE") being located as a result. This 
i|I technique is MP-exploitable because the array lookups may be performed by separate processors, 
10 if desired. (For example, a separate processor could be used for each array, or one processor 
'A could be used for looking up the first and then the second components while another processor 
simultaneously looks up the third and then the fourth components.) 



^ In the preferred embodiment, each of the 256 entries in each array is 8 bytes long, where 

1 a * the first 4 bytes may be used for status or informational purposes and the remaining 4 bytes may 
15 be used as a pointer field which points to a control block. Alternatively, the pointer field may 
point to the head of a chain of control blocks, where the control blocks in the chain are then 
processed (for example) sequentially. Or, the pointer field may point to an intermittent structure 
such as a hash table. The hash table approach will be useful in scenarios where there is a 
significant possibility of running into long chains of control blocks, and thus the hash table is used 
20 to enable accessing the multiple control blocks pointed to by a single pointer field more efficiently. 
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Techniques for using hash tables to improve access efficiency are well known, and will not be 
described in detail herein. 

Note that the preferred embodiment uses 8-byte entries, which are convenient when using 
CDS (Compare Double and Swap) operations an MVS (Multiple Virtual Storage) system. 
Alternatively, array entries of other lengths can be used without deviating from the scope of the 
present invention, and those entries may be divided in ways other than using 2 equal-sized 
divisions. Furthermore, the array entries may contain information which is interpreted differently 
than that used to illustrate the operation of the preferred embodiment. These alternatives are 
within the scope of the present invention. 

Regardless of the number of arrays in use (i.e. preferably 4 for IPv4 and 16 for IPv6), the 
length of entries in the arrays, and the interpretation of the contents of those entries, the most 
specific lookup answer found when using the present invention will be in the last byte array used, 
and the least specific entry will be in the first byte array. 

When using 4 arrays which are indexed by 8-bit numbers and therefore have 256 entries, 
each of which are 8 bytes long, each array requires 2,048 bytes and the total memory or storage 
required is therefore a total of 8, 192 bytes. The cost of performing a lookup operation is, in the 
general case, 4 lookup operations plus the cost of walking the chain or performing the hash 
search, as appropriate. Thus, the cost of looking up a particular IP address is nearly constant, 
regardless of the address value. For maintaining routing table entries, MVS routing table 
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techniques of the prior art require 2 locks to be held when adding or deleting entries in routing 
tables. With the present invention, however, the additions and deletions can be done using CDS 
and the implementation can be very much lock-less (that is, the CDS instruction allows updating 
fields without holding any locks since it serializes itself against any other threads). 

5 The preferred embodiment of the present invention will now be discussed in more detail 

with reference to the examples illustrated in Figs. 3 and 4. 

Fig. 3 illustrates an example of adding routes to routing tables, according to the preferred 
41 embodiment of the present invention. In this preferred embodiment, the information portion of 
01 each array entry includes a bit or flag which is shown in the figures as "MFlag" and referred to 
10 ";f herein as the "more flag". When set off, the more flag indicates that there is no more information 
stored in the subsequent lower-level arrays (i.e. the arrays for the next 8-bit components of the 
q address); when set on, the more flag indicates that the next lower-level array contains more 
□i information. 

Suppose a routing table entry is to be added for the route 9.67.96.0, using the network 
15 mask 255.255.255.0. This route is referred to as "Rl w in Figs. 3 and 4, and its mask is referred to 
as "GF\ The value of mask Gl indicates that the first 3 of 4 address components are significant 
when determining routing using route RL Because there is more information than just the first 
address component (having the value 9) in this routing table entry, the more flag 302 associated 
with index entry 9 (shown as element 301) in the first array 300 is set on. Similarly, the more flag 
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322 associated with index entry 67 (element 321) in the second array 320 is set on. Because there 
is no further significant information, according to mask Gl, after the third address component, the 
mask flag for the value 96 (shown at 342 for index entry 341) is initially set to off. (Note that the 
value of more flag 342 is shown in Fig. 3 as being set on: this value is changed from its initial 
5 setting when processing the next example described below.) The routing table information 

associated with this route is stored in a control block 344, which is pointed to by the pointer field 
343. In the preferred embodiment, a "use counf ' is associated with each control block, indicating 
how many different routes use the information in the control block. Because the Gl mask is a 
specific value for all 24 bits which comprise the first 3 address components, only 1 route uses this 
10 -7\ Rl information and thus the use count 345 is set to L With this Rl entry in the routing table, a 
81 route lookup for IP address 9.67,96.x (where "x" represents an arbitrary value) using a mask 
^ length of 24 bits would then access element 301 in array 300; it would see from more flag 302 
^ that there was more information to be learned by accessing element 32 1 in array 320; it would see 
h from more flag 322 that array 340 should be checked; it would access array 340 at element 341; 
15 q I and it would then determine from more flag 342 (which would be set to off at this point) that 
- * there was no more information to be learned from accessing the last array 360. Thus, the routing 
information in control block 344 would then be retrieved by following pointer 343. 

Suppose that another routing table entry is to be added for the route 9,67,96.5, using the 
network mask 255.255.255,255. This route is referred to as "R2" in Figs. 3 and 4, and its mask is 
20 referred to as "G2" The value of mask G2 indicates that all 4 address components are significant 
when using the routing table entry for route R2. The more flags corresponding to the first 2 
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address components (flags 302 and 322 in Fig, 3) have already been set to 1 ? so no change is 
required to those array entry values. However, the mask value of G2 used with route R2 
considers all 32 bits of the address value as being significant, and thus it is necessary to look into 
the fourth array 360 as well as the third array 340. Thus, the more flag 342 is now set to on. (As 
5 will be obvious, there is no more flag associated with entries in the final array.) The index entry 
361 associated with the final address component (having the value 5) then has its pointer field 363 
set to point to a control block 364 in which the routing table information associated with this 
route is stored. Because the G2 mask is a specific value for all 32 bits which comprise the 4 
address components, only 1 route uses this R2 information and thus the use count 365 is set to 1. 

10 y f Finally, a third routing table entry is to be added for the route 9.67. 1 1 1 .0, using the 

J% network mask 255.255.240.0. This route is referred to as "R3" in Figs. 3 and 4, and its mask is 

referred to as "G3". The value of mask G3 indicates that the first 20 bits of the DP address are 
□ significant when using the routing table entry for route R3. As discussed above with reference to 
2 1 addition of route R2, no change is needed to the more flags corresponding to the first 2 address 
15 - ? components. Route R3 with its mask G3 corresponds to routes 9.67.96.x through 9.67. 1 1 1 .x, as 
will be obvious to one of skill in the art by evaluating the corresponding bit settings. Thus, entries 
are created in the third array 340 for each route which is not yet represented therein. In 
particular, entries are created for index entries 97 through 1 1 1, as shown at element 351. 
Because the G3 mask value used with route R3 considers the final 12 bits of the address value as 
20 being insignificant, it is not necessary to look into the fourth array 360 and therefore the more 
flags for these entries are set to off. The pointer field 353 for these entries is set to point to a 
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control block 354 in which the routing table information associated with this route is stored. 
Note that R3 is now also an allowable alternative route to Rl (for address 9.67.96.x). Thus, a 
chain of control blocks is preferably created, wherein the control block 344 for Rl appears in the 
chain prior to the control block 354 for R3 (because Rl is more specific than R3). The use count 
5 355 for R3's control block 354 is set to 16 because there are 16 entries (including the entry 96 
and all the entries up through and including entry 111) which point to this control block. 

Figs. 4 A - 4C illustrate examples of looking up routes in the routing tables which have 
been created by the above-described examples referencing Fig. 3, according to the preferred 
II embodiment of the present invention. 

10 11 Suppose a route 9.67. 100. 1 is to be looked up. This example is illustrated in Fig. 4A. 

Using the value 9 as an index entry (shown at 401) into the first array, a more flag 402 which is 
Q set on is found, indicating that the next lower level array should be searched. The value 67 is 
3 1 therefore used as an index (shown at 404) into the second array, where again a more flag 405 
^ 1 which is set on is found. The value 100 is therefore used to index (shown at 406) into the third 
15 array. The pointer field 408 for this array element points to R3. Because the more flag 407 is set 
off, it is not necessary to use the value 1 as an index (shown at 409) into the final array. Thus, R3 
is the final answer. 



As another example, suppose a route 9.67.96. 10 is to be looked up. This example is 
illustrated in Fig. 4B. The more flags associated with index entries 9 and 67 in the first and 
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second arrays, respectively, indicate that more information is to be found in the third array (as 
shown in Fig. 4B at 41 1, 412, 413, 414), Indexing into this third array with the value 96 (shown 
at 415), a more flag 416 set to on is found, along with a pointer field 41 7 which points to the 
control block for Rl. Because the more flag 416 is set, the value 10 is then used as an index 
5 (shown at 418) into the final array. No value is found there, so the control block chain located 
with the pointer field 41 7 in the third array is traversed. Using the Gl mask which is associated 
with the Rl route of the first control block entry on the chain, the mask value is ANDed with the 
route being looked up, yielding a result of 9.67.96,0. Because this route matches the route 
destination (i.e. the result of the AND operation is the same as the result of performing an AND 
10 |i with the Gl mask and the Rl route), this route will be selected. If this route did not match, then 
□J the next entry in the chain would be analyzed similarly, until reaching either a match or the end of 
]l the chain, whichever occurs first. (Note that placing the control blocks on the chain in decreasing 

order of the number of significant bits, as shown in the examples for the preferred embodiment, 
r | eliminates the need to search each entry in the chain to find out which one yields the longest 
15 CP match.) 

As yet another example, suppose route 9.67. 1 12.4 is to be looked up. This example is 
illustrated in Fig. 4C. Again, the more flags assocated with index entries 9 and 67 in the first and 
second arrays indicate that more information is to be found in the third array (shown as elements 
421, 422, 423, 424 of Fig. 4C). Indexing into this third array with the value 1 12 (shown at 425), 
20 no entry is found. Thus, the search is backed up to the second array. Because the pointer for 
entry 67 is null, no match exists here and the search is then backed up to the first array. There 
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is also no match here because the pointer for entry 9 is null The result of this search is therefore 
that there is no explicit route to this destination. (Preferably, a check is then made to determine if 
there is a default route than can be used for this situation, such as 0.0.0.0,) 

As has been demonstrated, the present invention provides a novel technique for lookup up 
IP addresses which is fast and efficient, and which requires a minimal amount of storage. Using 
this technique, lookup time is nearly constant, regardless of the value being looked up. This 
technique may be exploited using multiple processors to further increase lookup speed. The 
disclosed technique may be used equally well with class-specific IP version 4 32-bit addresses and 
with Classless Inter-Domain Routing (CDDR), and easily extends to use with IP version 6 128-bit 
addresses. The technique capitalizes on the fact that the most specific searches are at an array 
associated with the final address component, and the least specific are at an array associated with 
the first address component. Furthermore, it exploits how BP addresses are subnetted, and how 
routes are added for various subnets/hosts for a particular organization. 

While the preferred embodiment of the present invention has been described, additional 
variations and modifications in that embodiment may occur to those skilled in the art once they 
learn of the basic inventive concepts. Therefore, it is intended that the appended claims shall be 
construed to include both the preferred embodiment and all such variations and modifications as 
fall within the spirit and scope of the invention. 
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What is claimed is: 

1 /L A computer program product for providing fast and efficient address lookup for an 

2 address comprised of a plurality of address components and wherein each address component is 

3 deemed to be more significant than its next-sequential neighboring address component, the 

4 computer program product embodied on one or more computer-readable media and comprising; 

5 computer-readable program code means for creating a plurality of arrays comprising an 

6 array for each of the address components, wherein each array comprises a plurality of entries 

7 which are indexed using values of the address component for which the array was created, further 

8 comprising: 

9 11 computer-readable program code means for obtaining a particular address value to 

10 ^? be represented in the plurality of arrays; 

11 j | computer-readable program code means for obtaining a bit mask associated with 

12 ;3 the particular address value; 

13 □ computer-readable program code means for indexing into a highest-order one of 

14 ;f ; the arrays using a most-significant component of the particular address value as an index element; 

1 5 computer-readable program code means for setting a flag associated with the index 

16 element to on if the bit mask indicates that the next-sequential neighboring address component is 

17 considered significant, and for setting the flag to off otherwise; and 

18 computer-readable program code means for repeating the indexing and setting 

19 while the bit mask indicates that the next-sequential neighboring address component is considered 

20 significant, and for (1) storing information associated with the particular address value in a 

21 storage or memory location and (2) setting a pointer field associated with the index element to 
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22 point to the storage or memory location, otherwise; and 

23 computer-readable program code means for retrieving the stored information associated 

24 with a selected address value from the plurality of arrays, further comprising: 

25 computer-readable program code means for obtaining the selected address value; 

26 computer-readable program code means for obtaining a selected bit mask 

27 associated with the selected address value; 

28 computer-readable program code means for indexing into the highest-order one of 

29 the arrays using the most-significant component of the selected address value as the index 

30 element; and 

31 II computer-readable program code means for determining that no result is available 

32 if the index element has no stored information, and for continuing otherwise, wherein the 

33 ji continuing further comprises: 

34 iS computer-readable program code means for checking the flag associated 

35 □ with the index element; and 

36 ^ I computer-readable program code means for returning the stored 

w ^ 

37 information from the storage or memory location pointed to by the pointer field when the flag is 

38 set off or for repeating the indexing and determining when the flag is set on. 

1 2, The computer program product according to Claim 1 , wherein the computer-readable 

2 program code means for repeating further comprises computer-readable program code means for 

3 setting a use count associated with the storage or memory location to a number which represents 

4 a count of the array entries which point to this storage or memory location when the next- 
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sequential neighboring address component is not considered significant. 



3 



1 3 . The computer program product according to Claim 2, wherein the stored information in 

2 the memory or storage location comprises an associated bit mask and wherein the computer- 
readable program code means for retrieving further comprises computer-readable program code 

4 means for resolving a collision, further comprising: 

5 computer-readable program code means for comparing the selected address value to each 

6 bit mask associated with the stored information from multiple storage or memory locations, 

7 yielding a plurality of bit mask results; and 

8 4) computer-readable program code means for selecting a collision result using that one of 

9 the bit mask results which both (1) matches the selected address value according to the selected 
10 bit mask and (2) has the longest associated bit mask. 



1 a 4. The computer program product according to Claim 1, wherein the address is an Internet 

2 ] £ Protocol (IP) address. 



5. The computer program product according to Claim 4, wherein the IP address is an IP 
version 4 address and wherein there are 4 components in each IP version 4 address and thus 4 
arrays. 



1 6. The computer program product according to Claim 4, wherein the IP address is an IP 

2 version 6 address and wherein there are 16 address components in each IP version 6 address and 

RSW9-2000-0050-US1 -24- 



thus 16 arrays. 

A system for providing fast and efficient address lookup for an address comprised of a 
plurality of address components and wherein each address component is deemed to be more 
significant than its next-sequential neighboring address component, the system comprising: 
means for creating a plurality of arrays comprising an array for each of the address 
components, wherein each array comprises a plurality of entries which are indexed using values of 
the address component for which the array was created, further comprising: 

means for obtaining a particular address value to be represented in the plurality of 

4 1 arrays; 

£1 means for obtaining a bit mask associated with the particular address value; 

d I means for indexing into a highest-order one of the arrays using a most-significant 

component of the particular address value as an index element; 

g m eans for setting a flag associated with the index element to on if the bit mask 

;fj indicates that the next-sequential neighboring address component is considered significant, and for 

=; setting the flag to off otherwise; and 

means for repeating the indexing and setting while the bit mask indicates that the 
next-sequential neighboring address component is considered significant, and for (1) storing 
information associated with the particular address value in a storage or memory location and (2) 
setting a pointer field associated with the index element to point to the storage or memory 
location, otherwise; and 

means for retrieving the stored information associated with a selected address value from 
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21 the plurality of arrays, further comprising: 

22 means for obtaining the selected address value; 

23 means for obtaining a selected bit mask associated with the selected address value; 

24 means for indexing into the highest-order one of the arrays using the most- 

25 significant component of the selected address value as the index element; and 

26 means for determining that no result is available if the index element has no stored 

27 information, and for continuing otherwise, wherein the continuing further comprises: 

28 means for checking the flag associated with the index element; and 

29 means for returning the stored information from the storage or memory 

30 II location pointed to by the pointer field when the flag is set offor for repeating the indexing and 

31 determining when the flag is set on. 

1 i8 8. The system according to Claim 7, wherein the means for repeating further comprises 

2 Q means for setting a use count associated with the storage or memory location to a number which 

3 u I represents a count of the array entries which point to this storage or memory location when the 

4 next-sequential neighboring address component is not considered significant. 



1 9. The system according to Claim 8, wherein the stored information in the memory or 

2 storage location comprises an associated bit mask and wherein the means for retrieving further 

3 comprises means for resolving a collision, further comprising: 

4 means for comparing the selected address value to each bit mask associated with the 

5 stored information from multiple storage or memory locations, yielding a plurality of bit mask 
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6 results; and 

7 means for selecting a collision result using that one of the bit mask results which both (1) 

8 matches the selected address value according to the selected bit mask and (2) has the longest 

9 associated bit mask. 

1 10. The system according to Claim 7, wherein the address is an Internet Protocol (IP) address. 

1 11. The system according to Claim 10, wherein the IP address is an IP version 4 address and 

2 wherein there are 4 components in each IP version 4 address and thus 4 arrays. 

. £ \ 

1 ij? 12. The system according to Claim 10, wherein the IP address is an IP version 6 address and 

2 jl wherein there are 16 address components in each IP version 6 address and thus 16 arrays. 

1 £ j y{, A method for providing fast and efficient address lookup for an address comprised of a 

2 [ i I plurality of address components and wherein each address component is deemed to be more 

3 i5S * significant than its next-sequential neighboring address component, the method comprising the 

4 steps of: 

5 creating a plurality of arrays comprising an array for each of the address components, 

6 wherein each array comprises a plurality of entries which are indexed using values of the address 

7 component for which the array was created, further comprising the steps of: 

8 obtaining a particular address value to be represented in the plurality of arrays; 

9 obtaining a bit mask associated with the particular address value; 
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10 indexing into a highest-order one of the arrays using a most-significant component 

1 1 of the particular address value as an index element; 

12 setting a flag associated with the index element to on if the bit mask indicates that 

13 the next-sequential neighboring address component is considered significant, and setting the flag 

14 to off otherwise; and 

1 5 repeating the indexing and setting while the bit mask indicates that the next- 

16 sequential neighboring address component is considered significant, and (1) storing information 

17 associated with the particular address value in a storage or memory location and (2) setting a 

18 pointer field associated with the index element to point to the storage or memory location, 

19 |1 otherwise; and 

20 '^f retrieving the stored information associated with a selected address value from the 

21 ^; plurality of arrays, further comprising the steps of 

22 „ obtaining the selected address value; 

23 Q obtaining a selected bit mask associated with the selected address value; 

24 3 f indexing into the highest-order one of the arrays using the most-significant 

25 issJ component of the selected address value as the index element; and 

26 determining that no result is available if the index element has no stored 

27 information, and continuing otherwise, wherein the continuing further comprises the steps of: 

28 checking the flag associated with the index element; and 

29 returning the stored information from the storage or memory location 

30 pointed to by the pointer field when the flag is set off or repeating the indexing and determining 

3 1 when the flag is set on. 
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1 14. The method according to Claim 13, wherein the repeating step further comprises the step 

2 of setting a use count associated with the storage or memory location to a number which 

3 represents a count of the array entries which point to this storage or memory location when the 

4 next-sequential neighboring address component is not considered significant. 

1 15. The method according to Claim 14, wherein the stored information in the memory or 

2 storage location comprises an associated bit mask and wherein the retrieving step further 

3 comprises resolving a collision, further comprising the steps of: 

4 i|| comparing the selected address value to each bit mask associated with the stored 

5 Of information from multiple storage or memory locations, yielding a plurality of bit mask results; 

6 ^ and 

7 = selecting a collision result using that one of the bit mask results which both (1) matches 

8 □ the selected address value according to the selected bit mask and (2) has the longest associated bit 

9 CP mask. 

1 16. The method according to Claim 13, wherein the address is an Internet Protocol (IP) 

2 address. 

1 17. The method according to Claim 16, wherein the IP address is an IP version 4 address and 

2 wherein there are 4 components in each IP version 4 address and thus 4 arrays. 
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1 18. The method according to Claim 16, wherein the IP address is an IP version 6 address and 

2 wherein there are 16 address components in each IP version 6 address and thus 16 arrays. 

1 Vj^ A method for providing fast and efficient address lookup for an address comprised of a 

2 plurality of address components, the method comprising the steps of: 

3 creating a plurality of arrays comprising an array for each of the address components, 

4 wherein each array comprises a plurality of entries which are indexed using values of the address 

5 component for which the array was created; 

6 storing entries and information for each address to be subsequently looked up, further 

7 :|| comprising the steps of 

V% "v. 

8 01 creating an entry for a particular address using the plurality of arrays; and 

9 storing information associated with the particular address value in a storage or 

10 memory location associated with a last significant component of the entry, wherein the last 

11 r| significant component is determined by a bit mask associated with the particular address; and 

12 01 retrieving the stored information associated with a selected address value from the 

13 ^ plurality of arrays. 
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ABSTRACT OF THE DISCLOSURE 

The present invention provides a method, system, and computer program product for fast and 
efficient IP (Internet Protocol) address lookup. IP address values are interpreted using a separate 
array for each 8-bit component of the address. This technique is therefore exploitable in a multi- 
processing environment. The IP address look-up time when using the present invention is nearly 
constant, regardless of the particular address value. Minimal, if any, locking is required. This 
technique can be used equally well with class-specific IP version 4 32-bit addresses and with 
Classless Inter-Domain Routing (CIDR), and easily extends to use with IP version 6 128-bit 
addresses. 



RSW9-2000-0050-US1 



-31- 



S . Khanna 

RSW9-2000-0050-US1 
1/6 




S . Khanna 

RSW9-2000-0050-US1 
2/6 



FIG. 2 



200 



0 



220 



0 



□ 

m 
m 
a 

j i 

i;; 

Q 
CI 
□! 
CI 
Q 



255 



255 



Info Ptr 



Info Ptr 



240 



260 



0 



0 



270 



d 



RTE 



255 



255 



Info Ptr 



Info Ptr 



result=F(a.b.c.d), 
F=lookup function 



S . Khanna 

RSW9-2000-0050-US1 
3/6 



FIG. 3 



300 



320 



301 















* m « 




MFIag=1 


null 











321. 















■ « ■ 




MFIag=1 


null 











Info Ptr 



Info Ptr 



341 



342 



351 



^6 
97 
98 
99 

111 



MFIag=1 



MFIag=C 



MFIag=C 



MFIag=C 



MFIag=C 



340 



R1 



R3 



R3 



R3 



R3 



343 



344 



361 
\ 

354 5 



R1 



R3 



u=1 
v 345 

-353 



u=16 
v 355 



360 











m m m 






R2' 



















363 



364 




365 



Info Ptr 

Route R1: 9.67.96.0 
Route R2: 9.67.96.5 
Route R3: 9.67.111.0 



Info Ptr 



255.255.255.0 G1 
255.255.255.255 G2 
255.255.240.0 G3 



S . Khanna 

RSW9-2000-0050-US1 
4/6 



FIG. 4A 



300 



o 
•4) 

□ ! 

CI 
4$ 

a 
□ 

a 



401 



402 
* 9 



407 
406 



96 
97 



H00 



111 















■ m m 




^1 


null 











320 



Info Ptr 



340 











■ ■ ■ 




1 


R1 


0 


R3 


■ ■ ■ 




^0 


R3 - 


■ ■ ■ 




0 


R3 







R1 



u=1 
408 



Info Ptr 



404 















■ * ■ 






null 










Info 


Ptr 

360 



409 



*1 



R3 



u=16 











mum 






R2 



















R2 



Lookup 9.67.100.1 
Result: useR3 



Info Ptr 



S . Khanna 

RSW9-2000-0050-US1 
5/6 



FIG. 4B 



300 



□ 

4) 

fl! 

01 
C! 

;= 
cs 

CI 
01 
CI 
□ 



411 



412 
*9 















■ ■ ■ 




"^1 


null 











415 



416 

-96 

97 



100 



111 



0 



0 



0 



Info Ptr 



340 



R1 



R3 



R3 



R3 



.417 



R1 



u=1 



Info Ptr 



320 



413 















mam 






null 










Info Rr 

360 



R3 



u=16 



418 



*10 











■ ■ ■ 






R2 



















R2 



u=1 



Lookup 9.67.96.10 
Result: use R1 



Info Ptr 



S . Khanna 

RSW9-2000-0050-US1 
6/6 



FIG. 4C 
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